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Abstract. Let (C, d) be an ultrametric Cantor set. Then it admits an isometric embedding 
into an infinite dimensional Euclidean space [30]. Associated with it is a weighted rooted tree, 
the reduced Michon graph T [28'. It will be called /-embeddable if there is a bi-Lipshitz map 
from (C, d) into a finite dimensional Euclidean space. The main result establishes that (C, d) 
is /-embeddable if and only if it can be represented by a weighted Michon tree such that (i) 
the number of children per vertex is uniformly bounded, (ii) if k denotes the weight, there are 
constants c > and < 5 < 1 such that k{v)/k{u) < c5'''"'"' where w is a descendant of u 
and where d{u, v) denotes the graph distance between the vertices u, v. Several examples are 
provided: (a) the tiling space of a linear repetitive sequence is /-embeddable, (b) the tiling space 
of a Sturmian sequence is /-embeddable if and only if the irrational number characterizing it 
has bounded type, (c) the boundary of a Galton- Watson random trees with more than one child 
per vertex is almost surely not /-embeddable. 



1. Introduction 

Embedding isometrically compact metric spaces into a Euclidean space is an old problem 
|17[ [HI [551 Elj that has been investigated by several communities, in Mathematics and in Com- 
puter Science as well. There are elementary examples of finite metric space for which such an 
embedding is impossible: the simplest example is the space Ki^s = {0,1,2,3} (see Fig. [l]and 
j2T]) with metric d(0, i) = 1 for i = 1, 2, 3 and d{i,j) = 2 if z / j G X \ {0}. This is the path 
metric on the graph with three vertices and the three internal edges. Then there is no isometric 
embedding of {X, d) in any Euclidean space. More generally. 

Theorem 1 (Schoenberg [32]). If{X, d) is a finite metric space, let x & X be a point and let A{x) 
be the matrix A{x) indexed by X\ {x} with elements A{x)y^z = {d{x, y)'^ + d{x, z)^ — d{y, z)'^)/2. 
Then {X, d) admits an isometric embedding in M" if and only if 

(i) A{x) is a positive matrix and 

(ii) the rank of A{x) is less than or equal to n. 

Testing whether A{x) is positive can be done through the Cholesky algorithm, which provides 
also a computation of the rank. This method is amenable to numerical calculation to analyze 
data sets (see t27ii8j for instance). 

The simplest type of infinite metric spaces for which the isometrical embedding problem can 
be extended are Cantor sets. A Cantor set is a compact Hausdorff space that is completely 
disconnected and has no isolated point. An old Theorem of Brouwer [7j claims that all Cantor 
sets are homeomorphic to the set {0, 1}^ of infinite sequences of O's and I's, endowed with the 
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Figure 1. The space Ki^s with d{0,i) = 1, d{i,j) = 2 is not embeddable 

product topology. While there is only one Cantor set, as a topological space, there are many 
metrics on it describing the topology. The subclass of ultrametrics turns out to be as natural for 
Cantor sets as the Euclidean metric on M". This was investigated in the eighties by Michon [28] 
who described all possible ultrametrics through the concept of weighted rooted tree. In a recent 
work [30j, J. Pearson and the first author, using this formalism, proved the following embedding 
result 

Theorem 2 (see |30j). Any ultrametric Cantor set can be embedded isometrically into an infinite 
dimensional Euclidean space with a countable basis. 

The algorithm behind this result also applies trivially to finite ultrametric spaces. Actually, in a 
finite space X, all metric are equivalent, so that replacing the metric by the smallest ultrametric 
which dominates it, allows to use the method of |30j to find a quasi-isometric embedding in a 
finite dimensional Euclidean space. The method however gives a dimension n which is at least 
the number of points of the space minus one. On the other hand, given any finite set F on 
the real line with the same cardinality as X and given any one-to-one map from X to F, the 
metric on X induced by the Euclidean metric on F is equivalent to the original one, so that 
quasi isometric embedding of X in M is always possible. As it turns out this is a subclass of sets 
for which the main result (Theorem [s]) of the present work applies. 

One of the questions suggested by Theorem [2] is whether the dimension of the embedding space 
can be made finite. There is an obvious obstacle to such an embedding: since the Cantor set 
is ultrametric, it cannot be embedded isometrically into M", because the Euclidean metric is 
not an ultrametric. However, it is sufficient that the two metrics be equivalent. Again there 
is an obvious obstacle, namely if such an embedding exists, then the Hausdorff dimension of 
(C, d) and of its image (j){C) are equal and therefore diniH{C, d) < n. So the question has to be 
rephrased as follows 

Definition 1. An ultrametric Cantor set {C,d) will be called f -embeddable if there is n gN, a 
positive real number c and a map (j) : C ^ M" such that 

-d{x,y) < \\(t){x) - (t){y)\\ < cd{x,y) , \/x,y eC . 

c 

Such a map will be called a quasi-isometry or a bi-Lipshitz map, or, for short, in this paper, an 
/-embedding. Then n will be called the dimension of the embedding. 

Acknowledgements: J. B. thanks A. Grigor'yan and his group at the CRC701, University of 
Bielefeld for giving him the opportunity to present this result prior to publication. A. J. thanks 
J. Savinien for discussions on this embedding problem. 
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2. The Main Result 

A graph 9 is a family S = C^^S-), where V, cahed the set of vertices, and £, called the set of 
edges, are countable sets. They occur with two maps (r, s) : £ — >• V, everywhere defined, called 
the range and the source. A finite path in S is a sequence 7 = (ei, • • • , e„) of edges such that 
r{ej) = s(ej+i). The integer n = \^\ is called the length of 7. The maps r, s extend to finite 
path by r(7) = r(e„) and 5(7) = s(ei). If x = 5(7) and y = r{'y) then 7 connects x to y and this 
will be denoted by 7 : x — >• y. If f = r(ej) = s(ej+i), then 7 is said to pass through v. A path 
is a loop if r(7) = 5(7). The relation "x is connected to y", defined by "there is a path 7 such 
that either 7 : x — >• y or 7 : y — )• x", is an equivalence. The equivalence classes of S are called 
the connected components. If there is only one connected component, S is called connected. A 
path 7 is a cycle if it is a non-trivial path 7 : x — t- x. In particular, a loop is a cycle. 

A tree T is a connected graph with no cycle. A rooted tree is a tree with a pointed vertex 
called the root. In such a case, given a vertex v G V there is a unique path connecting the root 
to V. Then \v\ will denote the length of this path and will be called the generation of v. This 
induces an order between the vertices as follows: two vertices v, w satisfies w ^ v whenever the 
unique path from to u; is passing through v. Then v is called an ancestor of w, while w is 
called a descendant oi v. In such a case, v is the father of w (or, equivalently, w is a child of v) 
whenever the unique path from v to w contains only one edge. A vertex is dangling whenever 
it has no descendant. A vertex is branching if it has at least two children. Given a rooted tree 
T = (V, £,0), let V C V be the set of branching vertices, which have infinitely many branching 
descendants, li w G V , define its predecessor v to be the closest ancestor in V. This leads 
to the reduced tree T' = ("\7',£',0') where e G £' is a pair {v,w) £ V x V , such that v is the 
predecessor of w. In such a case s{v, w) = v while r(u, w) = w. The new root 0' is either if the 
latter belongs to V or its closest descendant belonging to V. All the trees considered here will 
be reduced, unless specified otherwise. A rooted tree is called Cantorian whenever each vertex 
has only a finite number of children and each vertex admits a descendant having more than one 
child. 

Let T be a rooted tree. Its boundary 87 is the set of maximal paths starting at the root. If T 
has dangling vertices, the latter are representatives of a point in 87. So 87 is the union of the 
dangling vertices and of the set of infinite maximal paths. Given a vertex v, [v] will denote the 
set of paths in 87 passing through v. The family {[v]; v £ V} defines a basis for a topology on 
87 which makes this space completely discontinuous. In particular 87 is a Cantor set if and only 
if 7 is Cantorian [30] . Conversely any Cantor set can be described through a rooted tree which 
is not necessarily unique. If x, y G 87, then x A y will denote the youngest common ancestor 
of X and y. A weight of the rooted tree T is a map k : V — )• (0, 00) such that (i) ii w ^ v then 
k{w) < k{v) and (ii) lim|^|_i.oo k{v) = 0. A weight k leads to an ultrametric df^ on 87 by setting 

dK{x,y) = k{x Ay) , x,y £ 87 . 

Conversely the Michon theorem [2_8j establishes that any ultrametric Cantor set is isometric to 
the boundary of a reduced rooted Cantorian tree endowed with a weight. Such a tree will be 
called a Michon tree. The main result can be now expressed more precisely 

Theorem 3 (Main Result). An ultrametric Cantor set {C,d) is f -emheddahle if and only if it 
can he represented by a Michon tree 7 with weight k satisfying the following two conditions 
(i) the number of children of a vertex is uniformly hounded, 
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(a) there are constant c > and < 5 < 1 such that 



k[v) 

The proof that these conditions are necessary is new, to the best of the authors knowledge. 
However the proof that they are sufficient is deeply inspired by a recent paper by J. Savinien 
and the second author [22], in which a similar result was obtained for what the authors are 
calling self similar Cantor sets. Examples of such Cantor sets can be obtained as tiling spaces of 
substitution tilings ^31j . Thanks to the existence of a substitution, the conditions (i) and (ii) are 
easy to implement and the proof reduces to show the existence of an /-embedding. This proof 
is actually transferable to the present situation and will show that the two conditions (i) and 
(ii) are sufficient. In |22j though, the authors have an additional result, namely the dimension 
of the embedding can be chosen to be the smallest integer larger than the Hausdorff dimension 
of {C,d). Such a result cannot be expected to be true for a general ultrametric Cantor sets. 
However, the following result will be shown in Section [5] 

Proposition 1. // {C,d) is an ultrametric Cantor set for which the conditions (i) & (ii) of 
Theorem hold, then 

dimHiC,d) < 

where M is the maximum number of children per vertex. Moreover the smallest dimension of 
an f -embedding is at most equal to In (Mc + 1)/| ln5\. 

Several consequences of the Main Result will be proved at the end of this work. Given a doubly 
infinite sequence = {un)n<^N, of letters taken from a finite alphabet n„ G A, the tiling space 
of ^ is the set of all doubly infinite sequences sharing the same dictionary of finite words. It is 
customary to define a metric on the tiling space, called the combinatorial metric, as follows: if 
C = {'>J'n)neN,V = (^'n)neN are two sequences their distance is d{^,r]) = {N + where is 

the largest natural integer > such that Un = Vn for |n| < N. 

The first consequence of the Main Result concerns the tiling space of a linearly repetitive se- 
quence. A sequence x = {un)neN £ is linearly repetitive whenever there is K > 1 such that 
for each n E N and all pair of finite subwords v, w with length \v\ = n, \uj\ = Kn, then v occurs 
subword of w. 

Theorem 4. The tiling space of a linearly repetitive sequence is f -embeddable. 

It is possible to extend the notion of linear repetitivity to tilings in higher dimension [1]. The 
proof of the previous result, however, requires a stronger definition to be extended in higher 



dimension, related to forcing the border in a controlled way (See Section 6.3). This suggest that 

Conjecture 1. There is a suitable definition of linearly repetitive tiling in such that the 
transversal of any such tiling space is f -embeddable 

In a similar way, the next consequence concerns Sturmian sequences. These are doubly infinite 
sequences such that number of words of length n it contains equals n + \. A classical theorem 
(see, for instance, [5j) asserts that, for each such sequence x = {xn)n&z-, the alphabet contains 
two letters, usually denoted by {0, 1} and there is an irrational number a G (0, 1) such that 
Un = [x — no] — [x — (n + l)a] for some x G (0, 1]. The following theorem is a corollary of the 



Main Result, and is proved in two parts: "if" (Proposition 12), and "only if" (Proposition 13). 
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Theorem 5. The tiling space of a Sturmian sequence x associated with the irrational number 
a G (0, 1) is f -embbedable if and only if a has bounded type, namely if and only if its continued 
fraction expansion has bounded partial quotients. 

A possible extension of this result is provided by the tiling spaces of quasicrystals obtained by 
the cut-and-project method PU]. Namely let Z" seen as a subset of the Euclidean space M"". Let 
A be the unit cube [0, 1]" in M". Let then be a linear subspace of M" of dimension d, with 
irrational orientation, namely such that i^y Pi Z"' = {0}. Then S = = (A + £'j|) Pi Z'^. 

The set C = C{E^^) obtained as the orthogonal projection of in i^y gives a model for the 

atomic positions of a possible d-dimensional quasicrystal. The set C has several properties, in 
particular it is a Delone set of finite local complexity (FLC) ^25j. A local patch is a finite subset 
of iJ^ii of the form (£ — x) n -6(0; r] for some x G C. Here .6(0; r] denotes the closed ball centered 
at with radius r. Then FLC means that, for each r > the set of patches of radius r is finite. 
The tiling space of C is again the set of all Delone sets in , containing the origin sharing the 
same family of local patches. The combinatorial metric on the tiling space is defined in a similar 
way. Namely, if C and C are two FLC Delone sets, then d{C,C') is the inverse 1/R where R 
is the largest r > such that the patch of radius r at the origin coincide in both C and 
namely such that C D B{0; r] = C B{0; r]. By analogy with the previous result the following 
conjecture is expected 

Conjecture 2. For Lebesgue almost all d-dimensional linear space -Ey G , seen as points in 
the Grassmanian manifold of d-dimensional subspace ofW-, the tiling space of C{E\\), endowed 
with the combinatorial metric, is not f -embeddable. 

The last consequence analyzed here concerns random trees obtained from a Galton- Watson 
branching process |19l [3j (see [23] for a well documented fascinating history of this problem) . 
Let p be a probability defined on the set N = {0, 1, 2, • • • } of natural integers. The GW^-tree is 
constructed inductively, starting from the root, as follows: if u is a vertex, it has children 
where My is a random variable with distribution p and is independent from the M„ where u is 
any prior vertex constructed so far. It is a classical result in probability, partly proved initially by 
the Reverend Watson [35], called the Galton- Watson-Haldane-Steffensen critical theorem [23] . 
that if the average number of children {My) is less than or equal to one, the tree obtained in this 
way is almost surely finite (extinction). In particular its reduced tree is empty. On the other 
hand, as was eventually proved by Steffensen in 1930, if {My) > 1, the probability of extinction 
is less than one. However, the random tree produced in this way is likely to have dandling 
vertices. 

One way to avoid such a property is to force every vertex to have at least two children. This can 
be done by demanding that Po = pi = 0, namely that p be supported by [2, oo) C N: then the 
Galton- Watson tree is automatically reduced and this model can be called a Reduced Random 
Tree. The first consequence of the Main Result is the following 

Proposition 2. Let p be a probability on the set [2, oo) C N. If it has an infinite support, then 
the number of children per vertex is almost surely unbounded. In particular, for any weight, the 
boundary of the reduced Galton- Watson tree associated with this probability is almost surely not 
f -embeddable. 

In such a case a random weight can be added in the following way: a family (Ai,)t,gv of i.i.d. 
supported in [0,1], with common distribution p is defined. A weight is defined inductively by 
setting k(0) = 1 and k{v) = Xyn{u) is f is a child of u. In order that this defines a weight, it is 
required that p{0} = and p{l} < 1. Then 
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Theorem 6. Letp he a probability on the set [2,oo) C N and let p be a probability on [0, 1] such 
that p{0} = and p{l} < 1. If m = {M^) < p{l}~^ , let Sm he the unique solution of {X^)m = 1. 
Then the Reduced Random Tree T produced by the Galton- Watson process associated with p and 
endowed with the random weight associated with p gives rise to an ultrametric Cantor set 87 
with Hausdorff dimension Sm almost surely. In addition, its Hausdorff measure exists almost 
surely and is a random probability measure. 

3. Necessary Conditions for /-Embeddability 

The first result concerning the existence of an /-embedding is provided by 

Proposition 3. Let 7 be a weighted Michon tree with weight k such that d7 endowed with its 
ultrametric d^ is f -emheddahle. Then the number of children per vertex is uniformly hounded. 

Proof: By assumption, there is n E N, a constant c > 1 and a map (f) : 87 — )• M" such that 

(1) ^d^{x,y)<U(x)-(^[y)\\<cd^(x,y), yx,y d 87 . 

c 

Let then u G V be a vertex. By definition k{v) represents the diameter of [v\ in 87. Therefore 
diam{ (/)([?;])} < ck{v). Moreover, if w,w' are two distinct children of v, the distance between 
[w], [w'\ is equal to K{y) as well, thanks to the definition of d^. Therefore, given x^ E [w] for 
each child of v, the open balls B{xw, n{v)) are pairwise disjoint, because is an ultrametric. 
These balls are contained in [v\. Thanks to eq. ([T]) it follows that the euclidean open balls 
Bw = B{(f){xw), K,{v) / c) are pairwise disjoint and contained in a Euclidean closed ball B^ of 
radius ck{v). Let be the number of children of v. Let ujn be the volume of the Euclidean 
ball of radius 1 in M". Then the sum of the volumes of the Byj''s is at most equal to the volume 
of B^, leading to 

M„<(c2)", 

□ 

Remark 1. The Proposition [3] shows that if two Michon trees represents two equivalent ultra- 
metrics d, d' on the Cantor set C, then they both have a bounded number of children per vertex. 

□ 

Proposition 4. Let 7 be a Michon tree with a weight k and a uniformly bounded number of 
children per vertex. If the corresponding ultrametric Cantor set {87, d^) is f -embeddable, then 
there are Q < 9 <1, and c > 1 such that for any pair {v, w) of vertices such that v is an ancestor 
ofw 

^ <ceH-H 
k{v) 

Proof: Let M be the maximum number of children per vertex in 7. It is finite by assumption 
and since T is reduced, M > 2. Let < 5 < 1 be chosen. For each vertex v G V, let Desc5(w) 
be the set of descendants w of v such that k{w) < 6n{v). This set is not empty since the 
weight converges to zero along any path passing though v. Let Ss{v) be the maximal elements 
in Desc5(t>), so that w £ Ss{v) if and only if is a descendant of v such that k{w) < 6k{v) and 
its father w satisfies k,{w) > 5k,(v). 



k(v) \ 



< UJn {ck{v)Y , 
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Then a reduced graph T5 = (Vs, £5,0, ks) is constructed as follows: 

(i) its root is the same as the root of T, 

(ii) its set of vertices is a subset C V constructed inductively as follows: 

(a) Vs,i = SsW C V, 

(b) Vs,n+i = U«ev,,„ Ss{u), 

(iii) an edge e G £5 is a pair {v,w) where v £Vs and w £ Ss{v), 

(iv) its weight is given by ks{v) = k{v) ii v £ Vs- 

Let ds be the metric defined on d7s by the weight Kg. It follows, from the construction of 7s, 
that every infinite path x £ 87 starting at the root defines also a unique infinite path f{x) E d7s 
starting at the same root, by selecting, among the vertices defining x, the ones belonging to Vs- 
From this it follows that / is actually invertible and continuous. Hence, identifying 87 and 87s 
with the Cantor set C, / is just the identity map. 

If X, y are distinct paths in 87, then the vertex x Ay £ V may not belong to Vs- However, if v 
is its least ancestor in Vs, its weight satisfies always 

5k{v) < k{x Ay) < k{v) . 

Moreover, v is the least common ancestor of x and y inside the tree 7s, so that ds{x, y) = k{v) 
leading to 

5<#4<i. 

d5{x,y) 

Thus, the two metrics d,ds on the Cantor set C, are equivalent. In particular, {C,d) is em- 
beddable if and only if (C, ds) is. Hence, thanks to Proposition [sj the maximum number Ms 
of children per vertex in 7s is finite. It follows that the number of vertices in Ss{v) is bounded 
by Ms whenever v £ Vs- Let then r{v) be the maximal generation difference, in T, between v 
and an element w £ Ss{v). Since the path from v to w cannot contains more that Ms points 
r{v) < Ms- 

Let v,'w be two vertices in V such that w < v- Let vq {resp. wo) be the least ancestor of 
V {resp. w) belonging to Vs- By construction wq is a descendant of vq- If vq = wq, then 
Sk{vo) < k{w) < k{v) < k{vq). Hence 

k(w) S^M-H 
k{v) S^^s 

If now Wq / Vq, then, by construction 6Hi{wo) < k{w) < k{wo), and k{wo) < 6'^k{vo) where n is 
the generation difference between vq and wq in 7s- Hence, the generation difference between vq 
and Wq , in T, is at most nMs . Thus, ii\vo\ denotes the generation of in T, n > ( | t(;o | — bo I ) / ^5 > 
so that 



k{wo) 



< §i\wo\-\vo\)/Ms _ 



k{vo) 

Since |it;| — \v\ < |it;| — \vo\ < \w\ — \wo\ + \wo\ — \vq\ < Ms + |wo| — \vo\, this implies 

/CH < ^(^'O) < 1 ^{H-\v\)/M, _ 

k{v) 5k{vq) (5^ 

Since 7 is reduced, it follows that Ms > M > 2. Consequently, the conclusion holds with 
c < 5-*^^ and if (5^*^^ = 0. □ 
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4. The Embedding Theorem 

While the previous section showed the the conditions (i) and (ii) in Theorem [s] are necessary, 
this Section will prove that they are sufficient. The proof is just and adjustment of the proof of 
Savinien and the second author in [22J. 

Proposition 5. Let 7 be a Michon tree with a weight k with a bounded number of children per 
vertex and so that there are c > 1 and < 6 < 1 for which 

(2) ^<c^lH-bl ywT^v, v,w,eV. 

K{V) 

Then {87, d^) is f -embeddable. 

Proof: (i) Let M be the maximum number of children per vertex in T. Let g : V — )• 
{1, 2, • • • , M} be a map such that its restriction to the set of children of v be one-to-one for all 
vertex v £ V. Such a map consists simply in giving a numbering to each child of each vertex. 
Hence 1 < g{v) < M for all v. Let now L be a natural integer. The choice of L will be made 
later. Any point x G 87 will be represented by the sequence x = {vn)^=o of its vertices in T, with 
the convention that f„ has generation n. Then let ip : 87 — t- be defined by = {cl)r{x))r=i 
with 

oo 

cj)^(x) = ^ g{VjL+r) K{VjL+r-l) , 1 < r < L . 

j=0 

Since g{vn) < M and n{vn) < c^" it follows that the series converges absolutely and uniformly 
with respect to r and x. 

(ii) Let now x,y £ 87 with x ^ y with x = (fn)5^Lo ^^'^ V — (^n)$^Lo- Then let joi '^o be natural 
integers such that ja; A y| = j^L + tq — 1 and 1 < tq < L. It follows that the paths x and y share 
the same vertices until the generation |x A y| and they split at the next generation. Therefore, 

oo 

(t)r{x) - (l)r{y) = ^ {g{VjL+r)l^{vjL+r-l) - g{WjL+r)l<'{WjL+r^l)) 
3=30+1 

+ X{r > ro) igivjoL+r)KiVjoL+r-l) - giWjoL+r)K{WjoL+r-l)) 

It should be remarked that 



-9{WjL+r)K{WjL+r-l) < g{VjL+r)K.{VjL+r-l) - 9{WjL+r)n{WjL+r-l) < 9{VjL+r)K.{VjL+r-l) 

leading to the inequality (see eq. ([2|), 



\g{VjL+r)fiiVjL+r-l) - giWjL+r)K{WjL+r~l)\ < M k{x A y) C ^(^"^O^^+^-'^O 

Therefore 

Mc 

(3) \(l)r{x) - (l)r{y)\ < Y^-^ k{x A y) . 

Since k{x Ay) = d^ix, y), this map is Lipshitz continuous. On the other hand, ||(/)(x) — (/>(y)|| > 
{(prix) — (/'r(y)| for all 1 < r < L. In particular, using again eq. ([S]) and since Vj^i+ro / ^jo^'+^'o' 
it follows that 
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, , , , , , ,, , , McO^ , 
> \(t>ro{x) -(t>ro{y)\ > K{xAy) - ^ _ i^\xAy). 

If L is chosen so that 9^ < {Mc + l)""*^, it follows that this map is actually bi-Lipshitz. □ 

5. Hausdorff Dimension 

The minimal value of the embedding dimension L found in the proof of Proposition [5] is not as 
good as what was found in [22] for self similar Cantor sets. This is because in this case, more 
restrictions are put on the weight, in particular a lower bound of a type similar to the upper 
bound given in eq. In such a case it was shown that the minimal embedding dimension is the 
smallest integer larger than the Hausdorff dimension of the ultrametric Cantor set considered. 
This addresses the question of computing the Hausdorff dimension of the ultrametric Cantor set 
(07, d^). 



5.1. Hausdorff Dimension: the main result. The definition of the Hausdorff dimension |16) 
starts with the following construction: given an open cover U of C = 87 and given s £ [0, oo), 
let W(U) be defined by 

H'{U) = diam([/)^ 
In addition, let diam(U) = sup^/gu diam(C/). Then for < (5 < 1 let T-l'^{C) be defined by 

(4) H|(C)= inf n'(U) 

^ diam(U)<<5 

It follows that S' <5 ^ nUC) < ni,{C). In addition, if a > then < S^nHC) < 

■H|(C). Consequently, 

(i) the limit lim^^o'^KC) = "^^(0) exists in [0,+oo) U {+oo}, 

(ii) there is a unique sq such that if s > sq then ^.^{C) = whereas for s < sq, T-L^{C) = +oo. 
This unique value sq is precisely the Hausdorff dimension sq = diuiH (C, d) . In order to give a 
better formula, the following definition will be nedded 

Definition 2. A finite subtree T of 7 is a tree graph T = (Vr, S-r, 0) where 

(i) '^r C V is finite and contains the root 0, 

(ii) for every w S Vp every vertex of the unique path 7^, : — )• w in T belong to Vr , 
(Hi) £r is the subset of edges in £ of the form {v, w) where both v, w are in Vp. 

dV will denote the set of endpoints. The finite subtree T is called full if for any vertex v G r\3r, 
each child of v is a vertex ofV. 

The main result of this Section is 

Theorem 7. Let 7 be a reduced Michon tree with weight k. For any 6 > let &s(7) be the set 
of all full finite subtrees of 7 such that max^g^r t^i^) < 6. Then 

m{d7) = inf y K{vy . 
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Figure 2. A full finite subtree and its extremal vertices 



Corollary 1. Let 7 be a reduced Michon tree with weight k. If the maximum number of children 
per vertex is M and if there are c > 1 and < < 1 such that eq. holds, then 

, , InM 
dimH{d7,d^) < . 

I mt^l 

Given a vertex v let K{v) be the product of Af(u)~^ over the ancestors of v. Then K defines a 
weight called the Kraft weight. This leads to the following elementary result 

Corollary 2. Let 7 be a reduced Michon tree with its Kraft weight K. The Hausdorff dimension 
of (9T, dK) is exactly one. In addition, the Kraft weight defines a probability measure on 87 
which coincides with the Hausdorff measure o/(9T, dj^). In particular (87, dx) is f-embeddable 
if and only if 7 has a bounded number of children per vertex. 

The proofs of the last three results will be the content of the following Sections. 
5.2. Partitions by Closed and Open Sets. 

Definition 3. A c-partition of the Cantor set C is a finite family 7 = (Pi, P2, • " " > Pr) where 
the Pi 's are closed and open (clopen) subsets of C with union C and with empty pairwise inter- 
sections. 

That c-partitions exist as a refinement of any open cover, is essentially equivalent to the definition 
of a Cantor set. This is because the topology admits a basis of clopen sets. Hence every cover 
admits a refinement made of clopen sets. By compactness, every clopen cover admits a finite 
subcover. At last, the intersection or the union of two clopen sets are clopen sets as well. 
Therefore the minimal elements of the cr-algebra generated by a finite clopen cover are themselves 
clopen and they make up a partition. This argument shows that any open cover admits a refining 
c-partition. Conversely a compact space with this property is completely disconnected and, if it 
has no isolated point, it is a Cantor set. 

Proposition 6. Let 7 be a Michon tree with weight k. The extremal vertices of a full finite 
subtree make up a c-partition of C . Conversely, for any c-partition 7 of the ultrametric Cantor 
set (87, df^), there is a full finite subtree T such that each element of 7 is a union of extremal 
vertices ofV. 
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Proof: (i) Let T be a full finite subtree of T. li v,w £ dV then neither of them is the descendant 
of the other, just because they are both extremal. Hence [fin [w] = 0. Since F is full, the brothers 
of any extremal vertex are all vertices in T. Let x G d7 be an infinite path x = {vn)neM £ 97. 
Since vq = f!>, there is a unique no G N such that f„ G for n < uq and Vn ^ "^^r for n > uq. 
If were not an extremal vertex for L, then, since F is full, its child Vno+i should be in F as 
well, a contradiction. Therefore Vng G dV and x G [vnol- Consequently, the family {[v]; v e dF} 
is a c-partition. 

(ii) The sct{[i;] ; G V} is a basis for the topology of 87. Given any clopen set P C 87, any 
point a; G P admits a neighborhood of the form [vx] C P. The family {[vx] ; x G 87} is an 
open cover of P. Since P is closed, it is compact so that a finite subcover can be extracted. 
But the intersection [v] H [w] is either empty or, if not, one of the two [v], [w] is included in the 
other. In the latter case, remove the smallest from the family. Proceeding this way, leads to a 
subcover which is actually a c-partition of P by vertices of the Michon graph. Let now J" be a 
c-partition of 87. Let Vy be the set of vertices generating the c-partitions of the elements in J. 
Let also F be the finite subgraph of T made with vertices and edges of the finite paths joining 
the root to one of the vertices in Vy. Then F is a finite subtree by construction. Actually, it is 
full: for indeed if not, it has an extremal vertex v having one of its brother w not in Vp. But 
this in not possible because, T being a cover, [w] is covered by T and thus has a non empty 
intersection with some P G CP. Since [w] does not intersect any of the [u] with u G Vy, there is 
a contradiction with the construction of Vj>. Hence w must belong to as well. □ 

Lemma 1. Let (X,d) be an ultrametric complete space. Then every open ball is closed and 
every closed ball is open. 

Proof: Let x e X and let r > 0. The ball B{x; r) = {y G X ; d(x, y) < r} is open. If z G 
B{x;r) then there is y G B{x;r) with d{y,z) < r. Therefore d{x,z) < max{d(x, y), 2;)} < r 
so that z G B{x;r). Conversely, let B{x;r) = {y e X ; d{x,y) < r}. If y G B{x;r) and if 
z & B{y; r) then 

d{x, z) < max{c/(x, y),d{y, z)} < r , 
so that z G B{x; r). In particular B{y; r) C B{x; r) for all y G B{x; r). Hence B{x; r) is open. □ 

Lemma 2. Let {X, d) be an ultrametric compact space. Then for every open set U G X, there 
is a clopen set Pu D U such that diam(P;7) = diam(i7). 

Proof: For 5 > let = {y ^ X ; dist(y, U) < 5}. By definition = \J^^^ B{x; 5) which 
implies U = CI^^q U^. In particular, is open. Since U is closed, it is compact and therefore 
there are {xi, . . . , Xn) in U such that U C B{xi;6) U • • • U B{xn', S) = Pjj C U^. Since every open 
ball is also closed, it follows that Pu is clopen. Moreover diam(?7) < diam(P;7) < diam(L''^). 
If now x,y £ , there are x',y' G U such that d{x,x') < 5 and d{y,y') < 6. There- 
fore d{x,y) < max{d{x,x'),d{x',y'),d{y',y)} < max{(5, diam(?7)}. In particular diam(C/'^) < 
max{5, diam ([/)}. If 5 < diam(?7), it follows that diam(P[/) = diam([/). □ 

Lemma 3. Let {C, d) be an ultrametric Cantor set. Then any finite cover by clopen set CP = 
(Pi,P2,--- ,Pn) admits a refinement Q = {Qi,Q2r ■ ■ ,Qn) such that Qj C Pj and Q is a c- 
partition. 

Proof: The set Qi = Pi \ (P2 U ■ • ■ U P„) is clopen and contained in Pi. It may happen that 
Qi = 0. But in any case, C = Qi U P2 U • • • U P„, while Qi fl P,- = for j > 1. Hence replacing 
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C hy Ci = C \ Qi and J" by IPi = T \ {Pi} the same argument follows. The Lemma is proved 
by induction. □ 

5.3. Proof of Theorem [Tj. Let V be the set of vertices of the Michon tree T. Thanks to the 
representation of c-partitions provided by the Proposition [6| each clopen set P C 87 can be 
written as P = Uj=ibj] where {vi,V2, • • • ,vi) G V are vertices which are pairwise not ancestor 
of each others. This means that [vi] Ci [vj] = if u j 7^ Vj. It follows that the least common 
ancestor f 1 A ^2 A • • • A f/ = up provides a measure of the diameter of P namely 

diam(P) = k{vp) , vp = ^ v . 

veV;[v]cP 

Let now IP be a c-partition J". Several clopen sets of J" might be associated with the same vertex 
vp. In particular 

diam(P)" > K{vy . 

In addition there might be clopen sets P,P' £ "P such that vp is an ancestor of vpi. In such a 
case diam(P)'* + diam(P')'* > K{vpy. Let then Q' = {[vp] ; P G CP}. Since [vp] D P, it follows 
that Q' is an open cover. In view of the previous remark, it might happen that [vp] D [vp']. 
By removing the smaller set, it gives a subcover. Let then Q be the minimal subcover obtained 
from Q' in this way. Then 'H'^(J') > T-L^[Q,') > ^-^[Q). In particular, Q is a c-partition made 
of vertices of the Michon graph. In addition, by construction, if diam(lP) < (5 it follows that 
k{v) < 6 for all f G V such that [v] G Q. Hence the Proposition |6] implies that Q is defined by 
the extremal vertices of a full finite subtree T G &s{7). □ 

5.4. Kraft's Identity. Let 7 = (V, £,0,k) be the reduced Michon tree of the ultrametric 
Cantor set {C,d). For each vertex v £ V, let M{v) be the number of its children. If 7 = (vq = 
0, i^i, • • • , Vn) denotes a finite path starting at the root, then M{'y) is defined by 

n-l 

M{j) = l[M{v,) 

j=0 

Proposition 7 (see f^). IfT is any full finite subtree of 7 then 

E = 1 (Kraft's identity) 

Terr ^ ^' 

where Tr denotes the set of all maximal paths in T starting at the root. 

Proof: The proof is based upon an elementary remark: if 7 = {vq, f 1, • • • , Vn-i,Vn) is a path 
in CPr, then the choice of Vn can only be made among the children of Vn-i- In particular, if 
Ch(t') denotes the set of children of v, 

1 

M(^ ptr 

since M{vn-i) is exactly the number of children of Vn-i- Since T is full, it follows that this 
identity applies also to paths in T. To vary 7 further, it is necessary to sum over the possible 
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vertices Vn-i, namely among the children of Vn-2- Proceeding this way inductively gives the 
result. □ 

Remark 2. The usual Kraft identity, in coding theory, deals with binary tree graphs, represent- 
ing the possible strings of binary digits. Therefore M(v) = 2 for all vertex v. Then the previous 
identity reads X^^gy^, 2~^"'^ = 1. Every maximal path can be identified with a code word. If no 
pair of code words share the same prefix, then the family of these code words lead to a finite 
subtree. This tree is full if the family of codewords is maximal among the set of codewords with 
no prefix in common. This prefix-free property makes this family of words recognizable when 
read. In particular a recognizable set of code words must be a subset of Tr for some full finite 
subtree of the binary tree. Hence if W is the set of such codewords and if i{w) denotes the 
length of the word w, the following inequality is necessary for recognizability 

2-^("') < 1 (Kraft's inequality) 

□ 

5.5. Proof of Corollary [ij Let F be a full finite subtree of T. Then, using the Kraft identity 
(see Proposition [T]) , it follows that 



ii{vY < sup k(v)"M(7„) < c(r M) 



On the other hand, let A'^ be the smallest natural integer such that c0^* < 6. Then there is a 
full finite subtree P of T such that v £ dT \v\ > Ng. In particular, P G (^^(T) and therefore 

niidJ) <c{e'M)^' . 

Since TV^ oo as ^ 0, it follows that if O^M < 1, then W{d7) = 0, namely s > dim/^(aT). 
This gives the result. 



6. 5'-ADic Systems 

6.1. Definitions. This Section is devoted to the basic definitions for S'-adic systems. The 
presentation uses the notations of Durand [12l[T3]. Let ^ be a finite alphabet. Note A* the set 
of all finite words with letters in A. For w G A*, \w\ denotes its length, namely the number of 
letters it contains. Consider S a finite set of morphisms 



a £ S : A{a) — > A*, with A{a) C A. 

A 5-adic system is a sequence {an '■ An+i — )• (j4„)*)„gf^ G S^, such that the morphisms are 
composable. It will be assumed that for all n, every letter in An appears in a word an{b) for 
some b G An+i- For m > n the following notation will be used 



Cn,m — o . . . o (Tm—l '■ Am >■ An- 

It satisfies fj^.m ° CTm.k = Cn,fc- A S'-adic system is primitive if there exists some sq > such that 
for all r G N, for all a G ^r+so ^^"^ b £ Ar, the letter b appears in the word ar^r+soiO')- It is 
called proper if there exist two letters I and r in A such that for all cr G 5 and all a G A(a), cT(a) 
begins by the letter / and ends by r. Furthermore, it will be assumed from this point on, that 
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(5) lim mill |(Ti„,(c)| = +00, 

Hence, words of arbitrary long length can be obtained from iterating the substitutions on a 
single letter. Given a S'-adic system, there is a way to associate a subshift S C (it is then 
called a 5-adic subshift). Let T be the shift operator on 

T{. . . X-i ■ XqXi ...) = ... X-iXq ■ Xi . . . . 

It is straightforward to extend the morphisms of S to by concatenation 

a{. . . x_i • xqXi ...) = ... a{x^i) ■ a{xo)a{xi) .... 
It ought to be remarked that for all n, cri,n(0 is a prefix of (Ti^„+i(0- Similarly, (Ti^„(r) is a suffix 
of (Ti^„+i(r). Therefore, an element of can be defined by 



X = [ lim cri,„(r) • lim ai^n{l) 

where the dot separates the Xi with i > on the right from the ones with i < on the left. 
The subshift associated with the 5-adic system is the closure in (endowed with the product 
topology) of the orbit of x under the shift. This subshift, H, is endowed with the combinatorial 
distance d, namely two sequences have distance (n + 1)~^ whenever they coincide on a string of 
radius n around the origin and do not coincide beyond. If the S'-adic system is primitive, (H, T) 
is minimal (see Durand |12l Lemma 7]). 

Definition 4. A word x G A^ is called linearly recurrent or linearly repetitive (LR for short) 
if there is a constant K such that for all n € N, for all subwords w and w' of x respectively of 
length n and Kn, then w occurs in w' as a subword. 

In other word^ in a LR word, each finite subword repeat infinitely often and within a distance 
which varies linearly with their size. It is easy to see that a subshift generated by an LR word 
is minimal and that every elements of the subshift is LR with the same constant. In this case, 
the subshift itself is called linearly recurrent. The following is due to Durand 

Theorem 8 (Durand [13], Proposition 1.1). A subshift is S-adic primitive and proper if and 
only if it is linearly recurrent. 

The periodic case is trivial. In the non-periodic case, Durand's construction of the S-adic system 
associated with a linearly recurrent subshift involves return words. The S-adic system he builds 
in this case has the following property, which will be used in the proof of the result below (see |13l 
Section 4] for the construction, and [14^ Definition 9, Lemma 17] for the properties of return 
words and codes). 

Definition 5. Let {o'n)neN a S-adic system. It is said to have unique decomposition property 
if, given any element x in the subshift associated with H, there is a unique decomposition of x 
as a concatenation 

x = ... q-i(a-i) (Ji(ao) o-i(ai) . . . , 
where the index Q of x is in the underlined word (7i(ao). 

^No pun intended. 
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6.2. Proof of Theorem |4]. The periodic case is trivial (in this case, H is finite). Using the 
theorem of Durand cited above, it is enough to prove the fohowing 

Proposition 8. Let 3 be a a primitive proper S-adic subshift with unique decomposition property 
and endowed with the combinatorial metric. Then, it is f -embeddable. 

It will be convenient to describe an S-adic subshift in terms of a Bratteli diagram (see for 
example |14]). A weight on the Bratteli diagram permits to define an ultrametric on the set of 
its infinite path. It will be necessary to prove 

- the ultrametric Cantor set associated with this Bratteli diagram is bi-Lipschitz homeo- 
morphic to the subshift with the combinatorial metric; 

- the weight on the Bratteli diagram satisfies the hypothesis of Theorem [3] so that the 
Cantor set associated with the diagram is embeddable. 

The following result is needed. 

Lemma 4 (Durand [T2], Lemma 8). If the S-adic system generated by (fT„)„gN is primitive with 
constant sq, there exists a constant K such that for all integers r, s, with s — r > sq and for all 
b, c in As^i 

kr,a+l(&)| ^ ^ 
Wr,s+l{c)\ ~ 

Let ((T„ : An+i — >■ An) be a primitive 5-adic proper system with unique decomposition property 
and let / and r the letters associated with the properness. The Bratteli diagram is defined as 
follows 

- For all n G N, the set of vertices V„ is in bijection with the alphabet An- for each a G An, 
there is a Va € Vn- 

- For all 71 G N, an edge in £„ is a triple e = {va,l, Vb) where Va G V„, vi, G Vn+i and / G N 
such that the letter a occurs in the word ct„(6) in position / + 1. Hence, the number of 
edges from Va to Vb is equal to the number of times the letter a appears in Onih). 

- If e = {va, I, Vb) G £n its source is s{e) = Va, its range is r(e) = Vb and its label is /(e) = I. 

Example 1. If an{b) = labcar, then there are two edges ei,e2 from Va to Vb- one corresponds 
to the second letter and the other to the fifth letter of the word above. Their respective labels 
are 1 and 4. □ 

Definition 6. A path on the Bratteli diagram is a sequence of edges {ek)i<k<j (with \ < i < 
j < +oo), such that e^ G £fc for all k and r(efc) = s(e/;+i). Let n„ denote the set of paths with 
i = 1, j = n (simply called "paths of length n"). Let H denote the union of all n„, and let Iloo 
be the set of paths of infinite length (i = 1, j = +oo). 

Because the o"„'s are taken from a finite set of substitutions S the cardinality of the sets and 
V„ is uniformly bounded in n. Let 

vjn = (min{|o-i,„(a)| ; a G An}) ^ 

It is a decreasing sequence which tends to as n — t- oo. The weight of a finite path 7 will be 
defined by Wn+i whenever 7 has length n. This leads to a metric on Boo defined by 

dw{x, y) = Wn+i, where n is the length of the longest common prefix of x, y. 
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Proposition 9. The subshift endowed with the combinatorial metric, and {Iloo,dw) are 

homeomorphic though a bi-Lipschitz homeomorphism. 

Proof: (i) Constructing a map E — )• IIoo. Let x = . . . x^i ■ xqXi ... in H. Let vi E Vi be the 

vertex corresponding to the letter xq. By the unique decomposition property, x can be written 
in a unique way as the concatenation 

x = ... a{x'^^) a{xQ) a{x'i) ... , 

with (x^iez G (^2)^ and such that the letter of x of index is in the underlined word o"(xo). 
Therefore, if x' is the word x' = {x[)i^i 

(6) x = T^a{x'), 

This index k corresponds to an occurrence of the letter xq in the word cr(a;g). That is, it 
corresponds to an edge from vi to where V2 G V2 is the vertex corresponding to Xq. Iterating 
this process of "de-substitution" leads to construct a sequence of words . . . and 

a corresponding sequence of edges ei, 62, . . . , e„. In particular '4'{x) = (ei, 62, • . •) defines a map 

(ii) ip is a bijection. The map tp has an inverse (j) : Hoo ^ which will be built explicitly. 
Let 7 = (ei, 62, . . •) be a path going through the vertices vi,V2, ■ . .■ Given a finite word w = 
{■wq, . . . , wi^i) G A*, let e be a symbol not in A (the "empty" symbol). Define then w G {Au{e})^ 
is the sequence 

iD = ...€€e-weee.... 

This is a way of seeing a word in A* as a (partially defined) element of A^. For fixed n, the 
sequence 

(7) Ml) = o ai o r'(^2) o . . . o an-2 o T'(-«-^^+^ (r (T„_i(a„) z) , 

where /(e) denotes the label of the edge e, can be seen as an element of {A U {e})^, namely as 
the sequence 

• • • ee (Ti,„_i(r) ai^njan) o-i,n-i(0 « • • • , 

where the letter of index occurs in the underlined word. Its exact position is determined by 
the labels of the edges. This implies 

(8) [0n.(7)]j / e for - |cri,„_i(r)| < i < |cri,„_i(/)|. 

By hypothesis, limn-s.+oo |<7i,n-i(fe)| = +00 and this is true, in particular, whenever b G {r,l}. 
Therefore, the limit as n tends to infinity of (pnil) is an element of A^, which is noted 4>{x). 
Using the definitions of (f) and ip (see eq. ([g]) & ([T])), it is straightforward that -000 = idn^- 
Therefore, tp is onto and 4> is one-to-one. 

Conversely, let x G H, and 7 = 'ip{x). It will be shown that ^(7) = x. For all n, x has a unique 
decomposition 



X = ... CTl,„(6_l)fTl,n(feo)o-l,n(ftl) ■ • • , 
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with bi G An- So by definition of "0 and (j), 

(9) 0„(7) = . . . e fTi,„_i(r) ai^n{bo) fTi,n-i(0 e . . . , 

and the two words coincide on the word fTi,n(&o) (it appears at the same position). Furthermore, 
by properness of the S'-adic system, an{b-i) ends with the letter r while an{bi) begins with the 
letter /. Therefore, o"i,n(^-i) has ai^n-iir) as a suffix and ai^n{bi) has cri,n-i(0 ^ ^ prefix. So for 
all i ^ X such that [(j)n{7)]i / one has [(l)n{'y)]i = iCi- Taking a limit and using hypothesis ([5]), 
0(7) and X agree everywhere. Therefore, (p and ip are inverse bijections of each other. 

(iii) (j) is bi-Lipshitz. Let 7, 7' G IIoo be such that the n first edges of 7 and 7' coincide. 
Then, by definition, (pnil) = 4>ri{n')- Thus (/)(7) and i;A(7') coincide for all indices i satisfying 
— |(Ti^„_i(r)| <i < |cri^„_i(/)|. Using Lemma[4]and the definition of Wn leads to 

Since S is finite, C = 'ca.a'K„^s a<^A(c7) is well defined. Then, a word of the form <ti^„(5) = 

(Ti.n_i o (Tn~i{h) is at most C times longer than the longest word of the form cri^„_i(c). That is, 
using again Lemma[4j {wn)~^ < CK{wn-i)~^ ■ If [(p{'l)]i = [</'(7')]« H ^ K{wn-i)^'^ , then in 
particular this inequality holds for \i\ < {wn)~^/C. So 

d(7, 7') < ^^n d(<^(7), (^(7')) < Cw^n. 

Conversely, let 7 and 7' coincide up to edge n, but differing on their (n + l)-th one. Then, by 
definition of (p 

4>{l) = • • •0-l,n+l(ft-l)o-l,n+l(f'o)o-l,n+l(bl) . . • 

(/((V) = . . .cri,„+i(6'_i)cri,„+i(6o)o-i,„+i(6'i) . . . 

where / 6q and the letter of index belongs to the word (Ti_„+i(5o) (respectively ai^n+ii^'o)). 
By the unique decomposition property, </>(7) and have to differ for indices i satisfying 

-max{|cri,n+i(6o)|, |cri,n+i(&o)|} < i < max{|<Ti,n+i(6o)|, |o-l,n+l(&o)l}' 

that is for 

-K{Wn)-^ < i < K{Wnr\ 

This proves that is Lipschitz. □ 

Proof of Proposition [oj In order to prove that (H, d) is embeddable in M^, it is sufficient to 
prove that (Ilocd^) is embeddable. It suffices to remark that the corresponding Michon tree 
has the finite paths in IIoo as vertices and edges given by pairs of path differing only by one 
edge. By the finiteness of 5, all paths in 11^ have a bounded number of extensions to paths of 
n„+i and this bound is independent of n. Hence the Michon graph has a bounded number of 
children per vertex. So, the only thing left is to show that the sequence of weights {wn)n&i on 
the Bratteli diagram is bounded above by a decreasing geometric sequence. 

First, remark that for all n, the alphabet has at least two elements. If one of the An had 
only one element, the subshift H would be periodic (this case is already ruled out, because then 
the embedding is trivial). Using primitivity, for all letter c G ^sq+I' I^so(c)| > 2. By iteration, 
for all k and all c G ^^sq+i 
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Wks,{c)\>2\ 

So Wkso — 2 '^^ Since {wn)n&N is decreasing, Wn < CX"' with A = 2"^/'^'' < 1 and C is a constant. 
Thanks to Theorem |3] it fohows that is /-embeddable. 

6.3. About the Conjecture [l| The previous proof rehes on the fact that hnearly repetitive 
subshifts have a good representation by Bratth diagrams. While they are not self-similar, there 
are only a finite number of substitutions involved, which allows to generalize the methods of [22j . 
In a recent article, Aliste and Coronel [Ij provide a good description of any linearly repetitive 
tiling space as the set of paths on a Bratteli diagram. It is possible in the linearly repetitive 
case, to have a Bratteli diagram such that the number of vertices and edges in each Vn or £„ 
is bounded uniformly in n. They also have estimates for the size of the patches associated 
with paths of size n. These estimates are analogues of Durand's Lemma |4] and allow to define 
reasonable weights on the Bratteli diagram. Therefore, the path space of this Bratteli diagram, 
endowed with the distance defined by the weight, is /-embeddable. However, there is no proof 
yet that the homeomorphism between this paths space and the tiling space is bi-Lipschitz. 
In Proposition [9| the bi-Lipschitz character of (j) is proved through using equation Q . This 
requires an estimate on the length of (Ti^„(bo), of cri^„_i(r) and of cri^„_i(/). This is a quantitative 
version of a property known as "forcing the border" . It is unclear to the authors whether or not 
Aliste-Coronel's construction satisfies a similar quantitative border forcing property. Here is a 
possible analogue of the border forcing property that would be needed, restated in the context 

of H. 

Definition 7. Let {an '■ ^n+i — ^ ^n.)neN be a sequence of substitutions in where Ai is a set 
of proto-tiles (compact subsets of M.'^ which are the closure of their interior). Let C, X > 1 be 
two constant such that for all n gN and all t G An+i, 

C-^A" < rint(t) < i?cxt(t) < CA", 

where nnt(i) 0,^(1 i?ext(i) ^'"e the inside and outside radius of the tile t, respectively. The system 
has quantitative border forcing property if there is a C such that, for all n, all tile t, and all 
tiling T, if T contains o"i^„(t), then the patch of T contained in the C X"^ -neighborhood of the 
patch ai^n{t) does not depend on T . 

This property means that the neighborhood of cTi^„(t) is forced, up to a distance which is 
controlled. In a recent article, Giordano-Matui-Putnam-Skau (TH], can describe a tiling space 
as an inductive system (a sequence of "zoomed-out" tilings, in the sense of [Ij), such that the 
size, and to some extend, the shape of the tiles involved is tightly controlled. In particular, 
the authors can avoid "flat tiles" in a certain sense. It is possible that their methods can be 
used to produce an inductive sequence associated with a tiling, which has a quantitative border 
forcing property. If this could be done in a way which is compatible with Aliste and Coronel's 
methods, the proof presented above could be adapted right away to the case of tilings of higher 
dimension. In the light of this discussion, it seems reasonable to expect that the Conjecture [T] 
could be proved using the ideas above. However, the technical difficulties, namely those induced 
by using GMPS's construction, would be quite important. 



7. Sturmian Sequences 
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7.1. Definitions and notations. First, a few facts need to be introduced about Sturmian 
sequences and their coding. This section fohows in part the presentation in [2], and proof for 
some of the results cited below can be found there. 

Definition 8. Given a sequence x = {xn)nez G on the finite alphabet A, define its language: 
£^n{x) = {finite words ai . . .an which appear in x}, and C{x) = Cn{x). 

neN 

Definition 9. Given x G A^, its complexity function px is defined by 

Vn G N, Pxin) = Card(£,i). 
When there is no risk of ambiguity, Px{n) is just noted p{n). 

It is known that if for some n, Pxin) < n, then the word x is periodic. Sturmian sequences 
can be defined using the complexity function: they are the aperiodic sequences which have the 
lowest possible complexity. 

Definition 10. A sequence {xn)nGZ is called Sturmian if its complexity function satisfies p{n) = 
n + 1, and x is not eventually periodic. 

In particular, a Sturmian sequence is a sequence on two letters (since p{l) = 2). These two 
letters are noted and 1. 

Proposition 10. The frequency of the letter 1 in a Sturmian sequence x 

Card{A; e [-n, n] ; Xfc = 1} 
freq^(l := hrn — — 

n-s>+oo zn + 1 

is well defined, and is an irrational number, noted a G ]0, 1[. 

Definition 11. The Sturmian sequence x is of type zero if the frequency of 1 is less than one 
half, and of type one if this frequency is more than one half. 

It is easy to see that in a Sturmian sequence of type 1, the words 10, 01 and 11 may appear, 
but not the word 00. The same statements holds for sequences of type 0, just exchanging the 
letters and 1. 

Given a Sturmian sequence x, there is naturally a subshift of {0, 1}^ associated with it. It is by 
definition the set of all sequences y such that C{y) = C{x). It can also be defined as the closure 
of the orbit of x in A^ (for the product topology). The two definitions are equivalent. It is of 
course shift-invariant (hence the name "subshift"), and it is well known that it is minimal. 
Note that if x has frequency of ones equal to a, so do all the elements of its subshift. In particular, 
it makes sense to write that a Sturmian subshift is of type or of type 1. Conversely, the subshift 
associated with x is exactly the set of all Sturmian sequences which have the same frequency of 
ones. Therefore, it makes sense to denote H(a) the subshift associated with a G (0, 1). 
Sturmian sequences can be recoded using the following substitutions. Define: 

f h^O , f h^Ol 

^0 ^ I 1 ^ 10 ^"^d cTi : I ^ ^ ^_ 

Proposition 11. For any Sturmian sequence x of type 0, there is a Sturmian sequence x' such 
that either x = o'o{x') or x = TaQ{x'). (T is the shift operator, see Section^) 
For any Sturmian sequence x of type 1, there is a Sturmian sequence x' such that either x = 
(Ti(x') or X = Tai{x'). 
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Note ^ this re-coding map x i— ?■ x'. It is worth noting that if x,y are two elements of the same 
Sturmian subshift, then and belong to the same subshift. Non-periodicity of Sturmian 
sequences implies that for any Sturmian sequence y of type (resp. 1), there is a A; such that 
^''{y) is of type 1 (resp. 0). 

Definition 12. Let x be a Sturmian sequence, and (<I>"(a;))„gN be the sequence of re-coded 
Sturmian sequences. By definition, for all n, 

u^^' oa\^ o ...o ,To^"+i(^.2"+i(x)) and (t^^ o o . . . o <j\^" {^'^'^ {x)) 

are in the same orbit as x. All the bn are positive, except maybe bo = 0. The sequence (6n)neN 
is called the multiplicative coding of the Sturmian sequence x. 

All Sturmian sequences in a same subshift have the same multiplicative coding, and an acceptable 
multiplicative coding determines uniquely a Sturmian subshift. 

7.2. Partial fraction decomposition and multiplicative coding. The properties of a Stur- 
mian subshift are closely related to the partial fraction decomposition of the number a (which 
is the frequency of ones in the subshift). 

Let a G M be irrational. Then a can be written uniquely a = oq + ao, with oq £ Z and 
ao G (0, 1). The Gauss map G : [0, 1] — [0, 1], applied to ao, generates the continuous fraction 
expansion 

G{a) = a{a) , a{a) = 

a 

where [x] denotes the integer part of x, namely the largest integer smaller than or equal to x. 
Hence 

a = oq H , ai = a{a) , ai = G{a) . 

ai + ai 

Iterating this formula gives rise to the continuous fraction expansion 




a = oo H , ttn+i = G(a„) , an = a(a„_i) n>l 

ai H 

a2 H 



1 

as + • • • + 



The standard notation is 



a = [ao;ai,a2, • • • ,«„,•••], 

and the a„'s are called the partial quotients of a. 

Definition 13. A number a has bounded type whenever, the sequence of its partial quotients 
is bounded. 

Having bounded type is an exceptional property. This is a theorem of Khintchine. See also 
Levy [26]. 

Theorem 9 (see jSl])- For almost every a G [0, 1] the sequence (an)neN, of partial quotients of 
a is unbounded. 
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One famous example of such a typical number is e = 2, 71828 • • • the continued fraction of which 
was computed by Leonhard Euler in 1737 [TB] namely 

e= [2,1, 2,1, 1,4,1, 1,6,--- ,1,1, 2Z,---] a^i-i = 21 , ag^.a = asi = 1 , ^>1- 

Many properties of a Sturmian subshift are determined by the arithmetic properties of the 
number a associated with it. The following theorem is proved in Hedlund and Morse's seminal 
paper. 

Theorem 10 (see [29]). Consider a Sturmian subshift H C {0,1}^, of parameter a. This 
subshift is linearly repetitive (see definition^ if and only if a has bounded type. 

An immediate consequence of this theorem and theorem |4] is the following result. 

Proposition 12. Let 3 be a Sturmian subshift with parameter a, endowed with the combinatorial 
metric. If a has bounded type, then S is f-embeddable. 

This result needs a converse statement: if a has not bounded type, then the associated Sturmian 



subshift is not /-embeddable. This will be proved in section 7.3 

One way to understand the deep links between the combinatoric properties of a Sturmian subshift 
and the arithmetic properties of its parameter a is the following. A Sturmian sequence x can be 
seen as the coding of an orbit of the rotation of angle a on the circle: there is {Iq, Ii} a partition 
of the circle, and s a point in the circle such that = if and only if (s + a mod 1) G Iq. The 
rotation on the circle can be related to the partial fraction decomposition of a on the one hand, 
and on the multiplicative coding of x on the other hand, to get to following result. 

Theorem 11 ((see [2j). Let x be a Sturmian sequence, and a = [0; oi, a2, . . .] be the frequency of 
1 inx. Then the multiplicative coding ofx is given by the partial quotients of {\ — a) / a = a~^ — l. 

It is straightforward that (1 — a)/a = [ai — 1; 02, 03, .. .]. In particular, a has bounded type if 
and only if the sequence of coefficients of the multiplicative coding is bounded. 

7.3. Non embeddability of certain Sturmian subshifts. This section is devoted to the 
proof of the following result. 

Proposition 13. Consider a Sturmian sequence x, with associated subshift H and associated 
parameter a. If a has unbounded partial quotients, then H (with the combinatorial metric) is 
not embeddable in a finite dimensional space. 

A Sturmian subshift H is well described combinatorially by the (bilateral) tree of words of its 
elements. 

Definition 14. Given a Sturmian subshift H, its un-reduced tree of words is defined as follows. 

- For all n > 0, define a sequence of refining partitions of H by: 

= {[y~n ■■■Vn] ] V ^^}, 

where [y-n ■ ■ ■ Vn] is the cylinder set of all words in H which coincide with y on indices 
—n < i < n. 

- The set of vertices of the tree is in bijection with the disjoint union of all the Vn- If 
X G Vn, the associated vertex is noted vx- 

- There is an edge between vx and vy if and only if X £ Vn, Y G Vn+i and Y G X. 

- Add a vertex v^, and an edge between and the vertices vx for X gVq. 

- The weight of the vertex vx is ^/in + 1) if X £ Vn (n > 0), and 1 otherwise. 
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The (reduced) tree of words is obtained from this tree by the reduction process defined in section^ 

One remark about this tree: if X G Vn, then diam(X) < n, for the combinatorial distance, with 
equality if and only if X is the non-trivial union of two distinct elements of Vn+i- Note that the 
vertices vx of such clopen sets X are exactly the vertices with two children: these are precisely 
the ones which are not dropped by the reduction process. 
This leads to the following proposition. 

Proposition 14. The boundary of the reduced tree of words associated with a Sturmian subshift 
S is bi-Lipschitz homeomorphic to the subshift (endowed with the combinatorial metric). 

Proof: The proof is almost tautological: given an infinite path in the tree, say 7, the sequence 
of vertices (wx„)neN defines a decreasing sequence of compact sets X„, the diameter of which 
tends to zero. Therefore, its intersection is not empty and consists of a single element {x}. 
Define 4'{'^) = x. 

Conversely, given x G H, there is a unique decreasing sequence of sets X„ G Vn-, such that for 
all n, X £ Xn (explicitly: X„ = . . . Then it defines an infinite path 7 G dT. Clearly, 
7 is the unique pre-image of x by 4>. 

The fact that (j) is bi-Lipschitz (and in particular, it is a homeomorphism) results from the 
remark above, on the diameters of the elements of Vn- n 

Lemma 5. Let y be a Sturmian sequence, assume that y = cJq"(z), with z a Sturmian sequence. 
Then y contains the words 10''"1 and 10^" 0. 

Proof: liy = (Tq"(z), then y is of type and z is of type 1. By iteration, it is straightforward 
that cj^"(0) = and a^"(l) = 1(0^"). Since z is of type 1, it contains the words 10 and 11. 
Therefore, y contains the words: 

1(0^")1(0''") and 1(0^")0. 

□ 

Proof of Proposition |13| Let x be a Sturmian sequence, with a the frequency of 1, and 
assume that a does not have bounded type. Then its multiplicative coding (6n)neN is an un- 
bounded sequence. 

Let y = (^^o+bi+--+^n-i {^j^Y and z = (y). Without loss of generality, we assume that z is of 
type 1, so that 

y = a^Q-{z). 

Then, using previous lemma, y contains the words lO'^'^l and 10^" 0. Therefore, for all 1 < A; < 
[6„]/2 — 1,2/ contains the words 00^'^ and 00^^ 1. 

Applying the substitution cr := ctq^ o . . . o ctq^^^ the sequence x contains the words 

a(0)cj(0)2'=a(0) and a{Q)a{idf^a{l). 

Let a be the last letter of (t(0) and a{l) (it is the same: if 60 7^ 0, 1 otherwise). Since cr(l) 
starts by 1 and (t(0) starts by 0, x contains the words 

(10) acj(0)V(0)*^0 and 0^(0)^(0)^. 

Then, let Xk = [acr(O)^ • C7(0)'^], where the dot separates the indices i <Q and i > 0. It is an 
element of 'Pfc|CT(o)|; where |(t(0)| is the length of (t(0). Let Vk be the associated vertex. From 



Equation (10), the vertices have two distinct children (in the un-reduced tree of words). 



therefore, they are elements of the reduced tree, and their weight is (A;|(t(0)|) ^. In particular. 
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this construction shows that there are two vertices u, v, such that the quotient of their weights 
is [bn/2] — 1, and their distance in the tree is [bn/2] — 1. 

This construction can be done for ah n. If (6n)neN is unbounded, this shows that the weights 
cannot satisfy the geometric decay condition of Theorem [3| and H is not embeddable. □ 



The results presented here (namely proposition 12 and |13[) provide a proof of Theorem^ It is 



actually possible to give a more precise version of it. 

Theorem 12. A Sturmian subshift H(a) is f -embeddable if and only if the irrational number a 
associated with it has bounded type. In particular 

(i) if a is a quadratic irrational, then H(a) is f -embeddable; 
(a) for almost every a G (0, 1), the subshift H(a) is not f -embeddable; 
(Hi) the boundary of'E{e) is not embeddable for e = 2.71828. . .. 

Proof: The first part of this result is theorem [5| Point (i) is a consequence of the fact that 
quadratic irrational have an eventually periodic (hence bounded) partial fraction expansion. 
Point (ii) is a consequence of the theorem of Khintchine on continued fractions, and point (iii) 
is a consequence of the explicit formula for the partial fraction decomposition of e. □ 



8. Random Trees 

Random trees are good candidates to provide examples of Cantor sets with typical properties. 
In the history of random processes, the Galton- Watson process has certainly be a seminal exam- 
ple. Initially, Sir Francis Galton was concerned by the possible disappearance of family names 
among the British aristocrats in the nineteen century. He addressed this problem in a published 
article. Soon after, the reverend Watson proposed a method to attack the problem and provided 
a solution [35j. Unfortunately his prediction was that family names where all disappearing with 
probability one, which was certainly not what Galton could observe. Galton encouraged Watson 
to search further. It took up to the 1930's and the work of Haldane and Steffensen before the 
complete solution was found (see [23] for a fascinating review). As it turns out, branching pro- 
cesses of the Galton- Watson type appears in many other situations. For instance the description 
of the nuclear chain reaction or of electron emissions in a photomultiplier tubes can be described 
faithfully by such a process (see pL9]). 

The Galton- Watson process, as it is called today |19| [3] starts from a root and defines the 
number ^0 of offspring as an integer valued random variable with distribution p. Each offspring 
will be represented as a vertex v of the first generation of the random tree and a new integer 
valued random variable ^t,, describing the number of offspings of u, is drawn with probability p 
so that all the ^t,'s, including the root, are stochastically independent. Each vertex v such that 
7^ 0, will have offsprings represented as vertices of the second generation. Proceeding in the 
same way for all descendants of the root, the branching process is built inductively. The main 
difficulty, for the purpose of the present paper, is that the resulting random tree has, in general, a 
non zero probability to be finite. Moreover, even if it is infinite, it has an infinite number of dead 
branches, namely finite maximal paths. This is because, any vertex has a nonzero probability 
to have no offsprings or to have only one. The problem of pruning the tree to reduce it, leads 
to some difficulties that are interesting from the probabilistic point of view but provides useless 
complications as a nice illustration of the core result of the present article. For this reason, 
the model will be simplified by requiring that the probabilities po and pi, that the number of 
offspring be zero or one, respectively, vanish. So, from now on with probability one, the number 
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of offsprings will be at least two. In this way, each vertex has at least two children, so that the 
resulting tree is already reduced. This is why this branching process will be called a Reduced 
Random Tree. 

8.1. Reduced Random Trees: a review. Let p = {pn)n>2 be the probability supported 
by [2, oo) C N describing the number of children of each vertex. Let be the number of 
descendant at the generation n. Following Watson's idea, it is convenient to introduce the 
generating function 



^'n(x) = ^Prob{Z„ = /} 

1=2 

The construction of the random tree implies the following formula, where Vn denotes the set of 
vertices at generation n, 

(11) Zn+l = 5^ • 

Consequently the conditional probabilities are given by 

Vxoh{Zn+i = l\Zn = k} = ^ Pji---Pjk- 

iiH — ^jk=i 

In particular, it shows that (Z„)„gN* defines a Markov chain. Moreover 

oo 

(12) Pn+l{x) = Pn{P{x)) , P{X) = Y.PnX'' ■ 

n=2 

Since p is a probability, the series defining P converges for < x < 1. In addition, i-'(l) = 1, 
m = -P'(l) = IE(0 represents the average number of offsprings, if it exists. With the present 
restrictions, it follows that m > 2. The function x G [0, 1] — )• P{x) £ [0, 1] is positive, increasing 
and convex, more generally all its derivatives, when they exist, are positive, meaning that P is 



completely monotone. From the recursion relation ( 12 ) it follows immediately that 



Pn{x)= PoPo---oP {x) 

n 

In particular, since P{1) = 1, it follows that E(Z„) = -P^(l) = for all n. Hence the number 
of descendants at generation n grows exponentially fast with the generation in the average. 

Let now J^n be the sigma algebra generated by the variables for v vertices of the generations 



k < n. A classical remark made by Doob |lTj . is that, thanks to the equation (11) defining the 
process, the family (Zn)„gN, satisfies 

E (Z„+i|J"„) = mZn . 
Namely Wn = Zn/m^ is a martingale. As shown earlier by Doob [lOj this implies 

Theorem 13 (See jl9l |3]). Let p = {pn)n>2 be the probability distribution for the number ^ 
of offsprings such that the average IE(,^) = m and the variance Var(^) = are finite. Then 
the sequence Wn = Zn/m^ of random variables is a martingale with respect to the increasing 
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sequence Tn of a -algebras. In particular it converges almost surely to a random variable W such 
that 

2 

E(W) = 1 , VariW) = . 

— m 

8.2. Proof of the Proposition [2j If the probability p lias an infinite support, given any 
integer M > 2, the probability Pm that a given vertex has more than M children is non zero. 
The construction of the Random Reduced Tree, can be seen by associating inductively with 
any vertex of generation n a string (&i,62r'' i^n) of integers so that 1 < bn < Cfei,-- ,bn-i ■ 
particular, since > 2 for all u 's almost surely the subset W C V made of vertices for which 
all 6j's belong to {1,2} is non empty and gives an infinite binary subtree. Since the random 
variables {^^ ; v £ W} are i.i.d., it follows that , given M > 2 

Prob{e^ <M;yveA} = {l- Pm)*^ ■ 
In particular the probability that all vertices in "W have < M vanishes. □ 

8.3. Random Weight. In order that the boundary of the rooted random tree T = (V, £,0) 
previously built becomes an ultrametric Cantor set, it is necessary to put a weight on each 
vertex. The previous construction suggests that the weight itself be random and Markovian as 
well. In order to do so, the following model of random weight is proposed: let (A^)^gv be a 
family of i.i.d in [0, 1] with common distribution p; then the weight k{v) will be given by 

k{v) = k[u) , if n = father of v , k(0) = 1 . 

In order that this defines a good weight, it is required that A^, 7^ with probability one, namely 
pjO} = 0. Moreover, in order that the weight converges to zero along any infinite paths with 
probability one, it will be required that />{!}< 1. It will be convenient to use the following 
generating function (Mellin transform) 

h{s) = [ X'p{dX) . 
Jo 

It is easy to see that h is decreasing, logarithmically convex, namely h{r]Si + (1 — ?7)so) ^ 
hiySiY h{soy~'^ for < < 1, and lims^oo h{s) = p{l}. 

8.4. Proof of Theorem [6[ Thanks to Theorem [7j the Hausdorff dimension of the tree can be 
computed from the following random variables 

(13) n'(r)=J2<vr, nt,= J2^i^y- 

where F is a full finite subtree of the random 7. In the following & will denote the set of finite 
full subtrees of 7. This set is ordered by the inclusion of the vertex sets. In addition, giving 
r,r' G &, then they admit a least upper bound T V T' and a greatest lower bound TAT', as 
can be checked immediately. This suggests to define a new family of cr-algebras: let Jr be the 
(T-algebra generated by the ^^'s and the At,'s where v £ and u G Vr \ dV. In particular, if F 
is the full tree associated with generation n, then J^n will denote the cr-algebra generated by the 
^u's with u vertex of generation k < n — 1 and by the A^'s with v vertex of generation k < n. 
With these notations, the following result holds 
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Proposition 15. With the assumptions made previously on the construction of the random 
rooted weighted tree (T, V, 0, k) the following results hold 

(i) For all s > 0, the family Yn{s) = T-L'^/m'^h{s)'^ is a martingale with respect to the family 
^n] of a -algebras. In particular it converges almost surely to a positive random variable 

JneN 

Y{s) such thatE{Y{s)) = 1. 

(a) There is tm > Sm defined as the unique solution of h{2s) = mh{s)'^ , such that 

(a) if s < tm then Yn{s) converges almost surely to a constant, 

(b) if s = tm, then, 

Var(y(t„)) = 1 - 1 + ^ , 
(c) if s > tm the random variable Y{s) has not a finite second moment. 

(Hi) //yo{l} < and if Sm is the unique solution of mh{s) = 1, then the sequence of random 

variables Y(T) = 'H^"'(T) defines a martingale with respect to the family [J^r ] of a -algebras. 

V / re© 

In particular it converges almost surely to 1. 



Proof: (i) From the definition of Tif^ in eq. (13), it follows that 



E(^^)=E| 2^ A' I = /i(s)E(e0) = m/i(s) 
iDech(0) 



Moreover, 



yUeVn V&Ch{u) 



•F n 



= h{s)E Y i^iuy^u = mh{s) ^ ^(^)' 

\«ev„ / «ev„ 

= mh{s)nf, . 

In particular, this calculation shows that E(l^-|-i(s) | = Yn{s). In particular it is a martingale 
w.r.t. the family J^n of cr-algebras. Since Yi{s) = T-L^/mh(s) it follows that E(yi(s)) = 1. 
Therefore E{Yn{s)) = 1 for all n's. The convergence of this family is the main result of the 
martingale theory |llj . 

(ii) The calculation of the variance will be done through the second moment of Hf^. By con- 
struction 



E 



(K+i)'i-^n)=iE[ Y <^r<^r E E KK'\^n\ . 



Let the terms with u ^ u' he considered first. Then, since the Xy^s are independent for different 
u's, it follows that 
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t)6Ch(u) v'eCh{u') 



= h{sfm\{uyK{u'y . 



If now u = u', this gives two terms: the first one are terms for which v ^ v' and the other ones 
are for v = v'. The same type of calculation leads to 



EUiur Y 

\ vech{u) 
Grouping these results, leads to 



J' n 



h{2s)mK{uf\ 



E ((K+i) I ^n) = h{sf Tx? (K) + {h{2s) - h{s f) + h{sf}nl' . 
Averaging on both sides gives 



n+l) 



{m {h{2s) - h{sf) + cj^ h{sf] m''h{2s) 



It is worth noticing that, thanks to the definition of h, the Cauchy-Schwarz inequality gives 
/i(s)^ < h{2s). This inequality is actually strict because p{l} ^ 1, so that A is not almost surely 
equal to one. Therefore 



Var(y„(s)) 



1 h{2s] 



1-(1 



^mh{syj \^ m'h{2s)^ 

It follows that, ii s < Sm, then h{s)m > 1. In addition an elementary calculation shows that 

the map g[s) = h(2s)/h{s)'^ is monotone increasing, that g{0) = 1 and lims^oo g{s) = p{^}~^- 
Therefore, there is a unique > such that m = g{tm)- Using the definition of s^, it is easy 
to show that Sm <tm- Hence 

(a) if s < tm, liiiin->oo Var(l^(s)) = 0, implying that Yn converges almost surely to a constant; 
this constant can only be the common average, namely lim„_i>oo ^(s) = 1, 

(b) li s = tmi then the variance converges to a finite value 



tr. 



lim \ax{Yn{tm)) 



1 + 



1 

m 



(c) if s > tm, then the limiting random variable Y{s) does not have a finite second moment, 
(iii) Let now F' C T be two fiiU finite subtrees of T. Then there is a decreasing sequence of full 
finite subtrees such that F' C Fj c • • • Ti C Fq = F, and such that Fj+i is obtained from Fj by 
the following procedure: each vertex v G dVi which is not in dV is removed and replaced by its 
father. It is clear that, if Fj is full, so is Fj+i. This leads to 
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E(^^(r)|^r,) =IE Yl ^'^y E ^ ^(^)^ 

\Meari;Ch{M)n9r'=0 vech{u) J u^drndv 

Thanks to the definition of Jti the r.h.s. becomes 

E (^"(r) I Jti) = mh{s) Yl '^('^y + E • 

Meari;ch(u)nar'=0 uedmar 
In particular, if s = Sm, namely if mh[s) = 1, this gives YiJ") = 'H^'^iT) so that 



E(^y(r)| Jrij =Y{ri). 

Proceeding inductively along the chain of Tj's, this gives 

E fy(r) I Tr) = Y{r') . 



Therefore the family (y(r))pgjj is also a martingale w.r.t. the Jt's- The martingale theorem 
then shows that it converges almost surely. Since the full tree with boundary V„ is a member of 
this family and since it has been shown that the variance converges to zero (because Sm < tm), 
the family converges to a constant almost surely. □ 



Proposition 16. Under the hypothesis of Proposition 15, the Hausdorff dimension of [dT^d^] 
is almost surely equal to Sm- 



Proof: Thanks to Proposition 15, T-L^^^iT) converges almost surely to 1. It follows that 

W."- = inf ^'-(r), ^ \im.Wr = 1- 

^ re0« .54.0 

Consequently, HI — )• oo for s < Sm and HI — )• for s > Sm- Hence dim//(9T, d^) = Sm- ^ 



Proposition 17. Under the hypothesis of Proposition 15, the Hausdorff measure of [87 ^d^) at 
the dimension s = Sm exists almost surely and is a random probability. 

Proof: In order to prove it, it is sufficient to consider the basis of clopen sets of the form [u] 
for u gV. It boils down to consider 



a calculation similar to the one made in the proof of Proposition 15, shows that the family of 
{^*'"(r; u) ; T £ G , u £ Vr} is also a martingale satisfying 



for all full finite subtree Tq with u G OTq and T D Tq. Therefore, the martingale Theorem 
implies that = limr ?^'^'"(r; u) exists and that it is a random variable. Since the set of 

vertices is countable, the set of probability zero on which the convergence does not hold can be 
chosen independently on u £ V. By construction 
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showing that, after taking the hmit, YluedVo f^ii"^]) ~ ^- Since this is true for any fuh finite 
subtree, the Proposition [6] shows that the same relation occurs if OTq is replaced by any finite 
partition of 87 by clopen sets. Since clopen sets generates the cj-algebra of Borel sets, it shows 
that fi defines a probability measure on 87. □ 
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